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Abstract — We consider a wireless broadcast station that trans- 
mits packets to multiple users. The packet requests for each 
user may overlap, and some users may already have certain 
packets. This presents a problem of broadcasting in the presence 
of side information, and is a generalization of the well known 
(and unsolved) index coding problem of information theory. 
Rather than achieving the full capacity region, we develop a 
code-constrained capacity region, which restricts attention to a 
pre-specifled set of coding actions. We develop a dynamic max- 
weight algorithm that allows for random packet arrivals and 
supports any traffic inside the code-constrained capacity region. 
Further, we provide a simple set of codes based on cycles in the 
underlying demand graph. We show these codes are optimal for 
a class of broadcast relay problems. 



I. Introduction 

Consider a wireless broadcast station that transmits packets 
to TV wireless users. Packets randomly arrive to the broadcast 
station. Each packet p is desired by one or more users 
in the set {1,...,N}. Further, there may be one or more 
users that already have the packet stored in their cache. The 
broadcast station must efficiently transmit all packets to their 
desired users. We assume time is slotted with unit slots t € 
{0,1,2,...}, and that a single packet can be transmitted by the 
broadcast station on every slot. This packet is received error- 
free at all users. We assume that only the broadcast station 
can transmit, so that users cannot transmit to each other. 

If the broadcast station has P packets at time 0, and no 
more packets arrive, then the mission can easily be completed 
in P slots by transmitting the packets one at a time. However, 
this approach ignores the side-information available at each 
user. Indeed, it is often possible to complete the mission in 
fewer than P slots if packets are allowed to be mixed before 
transmission. A simple and well known example for 2 users is 
the following: Suppose user 1 has packet B but wants packet 
A, while user 2 has packet A but wants packet B. Sending each 
packet individually would take 2 slots, but these demands can 
be met in just one slot by transmitting the mixed packet A+B, 
the bit-wise XOR of A and B. Such examples are introduced 
in [1] [2] [3] in the context of wireless network coding. 

The general problem, where each packet is contained as 
side information in an arbitrary subset of the N users, is 
much more complex. This problem is introduced by Birk and 
Kol in [4][5], and is known as the index coding problem. 
Methods for completing a general index coding mission in 
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minimum time are unknown. However, the recent work [6] 
shows that if one restricts to a class of linear codes, then 
the minimum time is equal to the minimum rank of a certain 
matrix completion problem. The matrix completion problem is 
NP-hard in general, and hence index coding is complex even 
when restricted to a simpler class of codes. 

Nevertheless, it is important to develop systematic ap- 
proaches to these problems. That is because current wireless 
cellular systems cannot handle the huge traffic demands that 
are expected in the near future. This is largely due to the con- 
sistent growth of wireless video traffic. Fortunately, much of 
the traffic is for popular content. That is, users often download 
the same information. Thus, it is quite likely that a system 
of 7Y users will have many instances of side information, 
where some users already have packets that others want. This 
naturally creates an index coding situation. Thus, index coding 
is both rich in its mathematical complexity and crucial for 
supporting future wireless traffic. 

The problem we consider in this paper is even more complex 
because packets can arrive randomly over time. This is a 
practical scenario and creates the need for a dynamic approach 
to index coding. We assume there are M traffic types, where 
a type is defined by the subset of users that desire the packets 
and the subset that already has the packets. Let A m be the 
arrival rate, in packets/slot, for type M traffic. We approach 
this problem by restricting coding actions to an abstract set A. 
We then show how to achieve the code constrained capacity 
region A_4, being the set of all rate vectors (\ m )m=i that can 
be supported using coding actions in the set A. The set A_a is 
typically a strict subset of the capacity region A, which does 
not restrict the type of coding action. Our work can be applied 
to any set A, and hence can be used in conjunction with any 
desired codes. However, we focus attention on a simple class 
of codes that involve only bit-wise XOR operations, based 
on cycles in the underlying demand graph. In special cases 
of broadcast relay problems, we show that these codes can 
achieve the full capacity region A. 

The capacity region A is directly related to the conceptually 
simpler static problem of clearing a fixed batch of packets in 
minimum time. Further, index coding concepts are most easily 
developed in terms of the static problem. Thus, this paper is 
divided into two parts: We first introduce the index coding 
problem in the static case, and we describe example coding 



actions in that case. Section III extends to the dynamic case 



and develops two max-weight index coding techniques, one 
that requires knowledge of the arrival rates (A m ), and one 
that does not. The algorithms here are general and can also 
be used in other types of networks where controllers make 
sequences of actions, each action taking a different number of 
slots and delivering a different vector of packets. 
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While the static index coding problem has been studied 
before [6] [5] [4], our work provides new insight even in the 
static case. We introduce a new directed bipartite demand 
graph that allows for arbitrary demand subsets and possibly 
"multiple multicast" situations, where some packets are de- 
sired by more than one user. We also form a useful weighted 
compressed graph that facilitates the solution to the minimum 
clearance time problem in certain cases. This extends the graph 
models in [6], which do not consider the possibility of multiple 
multicast sessions. Work in [6] develops a maximum acyclic 
subgraph bound on minimum clearance time for problems 
without multiple multicast sessions. We extend this bound to 
our general problem using a different proof technique. Further, 
we consider a class of broadcast relay problems for which the 
bound can be achieved with equality. 

The next section introduces index coding in the static case, 
shows its relation to a bipartite demand graph, and presents 



the acyclic subgraph bound. Section III introduces the general 
dynamic formulation and develops our max- weight algorithms. 



Section IV considers an important class of broadcast relay 



networks for which a simple set of codes are optimal. 



II. The Static Minimum Clearance Time Problem 

This section introduces the index coding problem in the 
static case, where we want to clear a fixed batch of packets in 
minimum time. Consider a wireless system with N users, P 
packets, and a single broadcast station. We assume N and P 
are positive integers. Let Af and V represent the set of users 
and packets, respectively: 

Af={l,...,N} , V = {1,...,P} 

The broadcast station has all packets in the set V . Each user 
n E Af has an arbitrary subset of packets H n Q V, and 
wants to receive an arbitrary subset of packets lZ n C V, where 
Hn H = </>, where <f> represents the empty set. Assume 
that all packets consist of B bits, all packets are independent 
of each other, and the B-bit binary string for each packet is 
uniformly distributed over each of the 2 B possibilities. 

We can represent this system by a directed bipartite demand 
graph Q defined as follows (see Fig. [T|: 

• User nodes Af are on the left. 

• Packet nodes V are on the right. 

• A directed link (n, p) from a user node n E Af to a packet 
node p E V exists if and only if user n has packet p. That 
is, if and only if p E W n . 

• A directed link (p, n) from a packet node p E V to a 
user node n E Af exists if and only if user n wants to 
receive packet p. That is, if and only if p E TZ n . 

As an example for the 3-user, 5-packet graph of Fig. [T] the 
have and receive sets for nodes 1 and 2 are: 

Hi ={5} , 72! = {1,2} 

n 2 = 4> , n 2 = {1,2,4} 

We restrict attention to packets that at least one node wants. 
Thus, without loss of generality, throughout we assume the 
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Fig. 1. An example directed bipartite demand graph with 3 users and 5 
packets. 



graph Q is such that all packet nodes p E V contain at least 
one outgoing link. Thus: 



p = {i,...,p} = u£ =1 rc n 



(i) 



In this static problem, the broadcast station has all packets 
in the set V at time 0, and no more packets ever arrive. Every 
slot t E {0, 1,2,.. .} the broadcast station can transmit one 
B-bit message over the broadcast channel. This message is 
received without error at all of the user nodes in the set Af. 
The goal is for the broadcast station to send messages until 
all nodes receive the packets they desire. 

Define a mission-completing coding action with T slots to 
be a sequence of messages that the broadcast station transmits 
over the course of T slots, such that all users are able to decode 
their desired packets at the end of the T slots. We restrict 
attention to deterministic zero-error codes that enable decoding 
with probability 1. The initial information held by each user 
n E Af is given by the set of packets H n (possibly empty). 
Let A4={A4i, . . . , A4t} represent the messages transmitted 
by the broadcast station over the course of the T slot coding 
action. At the end of this action, each node n E Af has 
information {W n ,A4}. Because the coding action is assumed 
to complete the mission, this information is enough for each 
node n to decode its desired packets TZ n . That is, we can 
write: 

{H n ,M} ^ {n n ,M,TZ n } (2) 

where the above represents equivalence in the information 
set, meaning that the information on the left-hand-side can 
be perfectly reconstructed from the information on the right- 
hand-side, and vice versa. Clearly the information on the left 
in |2]) is a subset of the information on the right, and hence 
can trivially be reconstructed. The information on the right in 
(|2) can be reconstructed from that on the left because the code 
is mission-completing. 

For a given graph Q with P packet nodes, define T min (Q) as 
the minimum clearance time of the graph, being the minimum 
number of slots required to complete the mission, considering 
all possible coding techniques. Clearly T m i n {Q) < P. Our 
goal is to understand T m i n (Q). 

For a directed graph, we say that a simple directed cycle of 
length K is a sequence of nodes {ni, 712, . . . , Uk , Tlx} such 
that (rij, n i+ i) is a link in the graph for all i E {1, . . . , K— 1}, 
(riK, Hi) is a link in the graph, and all nodes {n\, . . . , uk } 
involved in the cycle are distinct. For simplicity, throughout 
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this paper we use the term cycle to represent a simple directed 
cycle. We say that the graph Q is acyclic if it contains no 
cycles. Note that directed acyclic graphs have a much different 
structure than undirected acyclic graphs. Indeed, the graph 
in Fig. [T] is acyclic even though its undirected counterpart 
(formed by replacing all directed links with undirected links) 
has cycles. 

Our first result is to prove that if a directed bipartite demand 
graph Q is acyclic, then coding cannot reduce the minimum 
clearance time. This result was first proven in [6] in the case 
without "multiple multicasts," so that each packet is desired 
by at most one user. That result uses an argument based on 
machinery of the mutual information function. It also treats 
a more general case where codes can have errors. Further, 
their proof is developed as a consequence of a more general 
and more complex result. Our work restricts to zero-error 
codes, but allows the possibility of multiple-multicast sessions. 
We also use a different proof technique which emphasizes 
the logical consequences of users being able to decode their 
information. Our proof uses only the following two facts: 

Fact 1: Every directed acyclic graph with a finite number 
of nodes has at least one node with no outgoing links. Such 
a node is called a "leaf" node. 

Fact 2: If the graph contains only one user node, then 
T m i n (Q) = P, where P is the number of packets that this 
user desires. 

Fact [T] follows simply by starting at any node in the graph 
and traversing a path from node to node, using any outgoing 
link, until we find a leaf node (such a path cannot continue 
forever because the graph is finite and has no cycles). Fact [2] 
is a basic information theory observation about the capacity 
of a single error-free link. 

Theorem 1: If the graph Q is acyclic, then T m i n (Q) = P, 
where P is the total number of packets in the graph. 

Proof: See Appendix A. □ 

As an example, because the graph Q in Fig. [T] is acyclic, 
we have T min (Q) = 5. Theorem [T] shows that coding cannot 
help if Q is acyclic, so that the best one can do is just transmit 
all packets one at a time. Therefore, any type of coding must 
exploit cycles on the demand graph. 

A. Lower Bounds from Acyclic Subgraphs 

Theorem [T] provides a simple lower bound on T m i n (Q) for 
any graph Q. Consider a graph Q, and form a subgraph Q' by 
performing one or more of the following pruning operations: 

• Remove a packet node, and all of its incoming and 
outgoing links. 

• Remove a user node, and all of its incoming and outgoing 
links. 

• Remove a packet-to-user link (p,n). 

After performing these operations, we must also delete any 
residual packets that have no outgoing links. Any sequence of 
messages that completes the mission for the original graph Q 
will also complete the mission for the subgraph Q' . This leads 
to the following simple lemma. 



Lemma 1: For any subgraph Q' formed from a graph Q by 
one or more of the above pruning operations, we have: 

PminiQ ) — -^min(^) 

Combining this lemma with Theorem [T] we see that we 
can take a general graph Q with cycles, and then perform the 
above pruning operations to reduce to an acyclic subgraph Q' . 
Then T min (Q) is lower bounded by the number of packets in 
this subgraph. Thus, the best lower bound corresponds to the 
acyclic subgraph generated from the above operations, and 
that has the largest number of remaining packets. Note that 
the above pruning operations do not include the removal of a 
user-to-packet link (n,p) (without removing either the entire 
user or the entire packet), because such links represent side 
information that can be helpful to the mission. 

B. Particular code actions 

Because the general index coding problem is difficult, it is 
useful to restrict the solution space to consider only sequences 
of simple types of coding actions. Recall that coding actions 
must exploit cycles. One natural action is the following: 
Suppose we have a cycle in Q that involves a subset of K 
users. For simplicity label the users {1, . . . , A"}. In the cycle, 
user 2 wants to receive a packet Xi that user 1 has, user 
3 wants to receive a packet X 2 that user 2 has, and so on. 
Finally, user 1 wants to receive a packet Xk that user K has. 
The structure can be represented by: 

1^2^3^...^X^1 (3) 

where an arrow from one user to another means the left user 
has a packet the right user wants. Of course, the users in this 
cycle may want many other packets, but we are restricting 
attention only to the packets Xi,...,Xk- Assume these 
packets are all distinct. 

In such a case, we can satisfy all K users in the cycle with 
the following K — 1 transmissions: For each k g {1, . . . , K — 
1}, the broadcast station transmits a message Ai k =X k + 
X k+1 , where addition represents the mod-2 summation of the 
bits in the packets. Each user k £ {2, . . . , K} receives its 
desired information by adding M. k -i to its side information: 

X k + M k -i = X k + (X fe _! + X k ) = X k ^ 

Finally, user 1 performs the following computation (using the 
fact that it already has packet Xi): 

X 1 +Mi+M 2 + ■■■ + M K -i 

= X ± + (X 1 + X 2 ) + (X 2 + X 3 ) + ...+ {X K -! + X K ) 
= (X! + Xi) + (X 2 +X 2 ) + ... + {X K -i + X K -i) 
+X k 

= x K 

Thus, such an operation can deliver K packets in only K—l 
transmissions. We call such an action a K-cycle coding action. 
We define a 1 -cycle coding action to be a direct transmission. 
Note that 2-cycle coding actions are the most "efficient," 
having a packet/transmission efficiency ratio of 2/1, compared 
to Kj (K — 1) for K > 2, which approaches 1 (the efficiency 
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of a direct transmission) as K — > oo. While it is generally 
sub-optimal to restrict to such cyclic coding actions, doing 
so can still provide significant gains in comparison to direct 
transmission. Further, we show in Section [IV] that such actions 
are optimal for certain classes of broadcast relay problems. 

Another important type of code action takes advantage of 
"double-cycles" in Q: Suppose for example that user 1 wants 
packet A and has packets B and C, user 2 wants packet B 
and has packets A and C, and user 3 wants packet C and has 
packets A and B. Then these demands can be fulfilled with 
the single transmission A + B + C, being a binary XOR of 
packets A, B, C. The efficiency ratio of this action is 3/1. 

III. Dynamic Index Coding 

Now consider a dynamic setting where the broadcast station 
randomly receives packets from M traffic flows. Each flow 
m € {1, . . . , M} contains fixed-length packets that must be 
delivered to a subset Af m of the users, and these packets are 
contained as side-information in a subset S m of the users. We 
assume JV m H S m = <j>, since any user n E Af m who wants 
the packet clearly does not already have the packet as side 
information. In the general case, M can be the number of all 
possible disjoint subset pair combinations. However, typically 
the value of M will be much smaller than this, such as when 
each traffic flow represents packets from a very large file, and 
there are only M active file requests. 

Assume time is slotted with unit slots t E {0, 1,2,.. .}, and 
let A(t) — (Ai(t), . . . , A M (t)) be the number of packets that 
arrive from each flow on slot t. For simplicity of exposition, 
we assume the vector A(t) is i.i.d. over slots with expectation: 

E{A(t)} = \=(\ 1 ,...,\ M ) 

where A m is the arrival rate of packets from flow m, in units 
of packets/slot. For simplicity, we assume that A m (t) E {0, 1} 
for all m and all t, so that at most one new packet can arrive 
per flow per slot. This is reasonable because the maximum 
delivery rate in the system is one packet per slot, and so any 
packets that arrive as a burst can be "smoothed" and delivered 
to the network layer at the broadcast station one slot at a time. 
Packets of each flow m are stored in a separate queue kept 
at the broadcast station, and exit the queue upon delivery to 
their intended users. 

We now segment the timeline into frames, each frame 
consisting of an integer number of slots. At the beginning 
of each frame r, the network controller chooses a coding 
action a[r] within an abstract set A of possible actions. For 
each a E A, there is a frame size T(a) and a clearance 
vector fi(a). The frame size T(a) is the number of slots 
required to implement action a, and is assumed to be a 
positive integer. The clearance vector fi(a) has components 
(;Ui(a), . . . ,^m{o>)), where /i m (a) is the number of type 
m packets delivered as a result of action a. We assume 
/Lt m (a) is a non-negative integer. When frame r ends, a new 
frame starts and the controller chooses a (possibly new) action 
a[r + l] E A. We assume each coding action only uses packets 
that are delivered as a result of that action, so that there is no 
"partial information" that can be exploited on future frames. 



We further assume there are a finite (but arbitrarily large) 
number of coding actions in the set A, and that there are 
positive numbers T max and fJ, max such that 1 < T(a) < T max 
and < fJ.(a) < fi m ax for all a E A. 

Assume that frame starts at time 0. Define t[0] — 0, and 
for r E {0, 1,2,.. .} define t[r] as the slot that starts frame r. 
Let Q[r] — (Qi[r], . . . , QmH) be the queue backlog vector 
at the beginning of each frame r E {0, 1,2,.. .}. Then: 



Q I: 



1] = max[Q m [r] — /i m (a[r]), 0] + arrivals?. 



(4) 



where arrivals m [r] is the number of type m arrivals during 
frame r: 

t[r]+T(a[r])-l 

arrivals m [r]= 2J 4iW (5) 

T=t[r] 

The max[-,0] operator in the queue update equation Q in 
principle allows actions a[r] E A to be chosen independently 
of the queue backlog at the beginning of a frame. In this case, 
if the action a [r] attempts to deliver one or more packets from 
queues that are empty, null packets are created and delivered. 
In practice, these null packets do not need to be delivered. 

Our focus is on index coding problems with action sets A 
defined by a specific set coding options, such as the set of 
all cyclic coding actions. For example, an action a that is a 
2-cyclic coding action that uses packets of type m and k has 
T(a) = 1 and fi(a) being a binary vector with Is in entries 
m and k and zeros elsewhere. However, the above model is 
general and can also apply to other types of problems, such 
as multi-hop networks where actions a E A represent some 
sequence of multi-hop network coding. 

A. The Code-Constrained Capacity Region 
We say that queue Q m [r] is rate stable if: 

lim ^"^j^ = o (with probability 1) 

It is not difficult to show that Q m [R] is rate stable if and only 
if the arrival rate A m is equal to the delivery rate of type m 
traffic [7]. The code-constrained capacity region A_4 is the 
set of all rate vectors (Ai, . . . , Ajw) for which there exists an 
algorithm for selecting a[r] E A over frames that makes all 
queues rate stable. 

Theorem 2: A rate vector A is in the code-constrained 
capacity region if and only if there exist probabilities p(a) 
such that 2~2aeAP( a ) = 1 anc ' : 

Eae^P( a )Mm(Q<) 



A™ < 



Vme {1,...,M} 



(6) 



Proof: The proof that such probabilities p(a) necessarily 
exist whenever A E A^ is given in Appendix B. Below we 
prove sufficiency. Suppose such probabilities p(a) exist that 
satisfy |6j. We want to show that A E A^. To do so, we 
design an algorithm that makes all queues Q m [r] in Q rate 
stable. By rate stability theory in [7], it suffices to design an 
algorithm that has a frame average arrival rate to each queue 
Q m [r] that is less than or equal to the frame average service 
rate (both in units of packets/frame). 
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Consider the algorithm that, every frame r, independently 
chooses action a G A with probability p{a). Let a*[r] 
represent this random action chosen on frame r. Then 
{T(a*[r])}^ is an i.i.d. sequence, as is {p m (a* [r})}^ =0 for 
each m G {1, . . . , M}. By the law of large numbers, the frame 
average arrival rate arrivals m and the frame average service 
JL m (both in packets/frame) are equal to the following with 
probability 1: 

Mm = E{Mm(a*M)} = T, a eAP( a )^rn(a) 
arrivals m = X m E{T(a*[r})} = X m J2 aeA p(a)T(a) 

We thus have for each m G {1, . . . , M}: 

arrivals m _ X m J2 a eAP( a )T( a ) 



where the final inequality follows by (|6j. 



< 1 



□ 



B. Max-Weight Queueing Protocols 

Theorem [2] shows that all traffic can be supported by 
a stationary and randomized algorithm that independently 
chooses actions a*[r] G A with probability distribution p(a). 
This does not require knowledge of the queue backlogs. 
However, computing probabilities p(a) that satisfy (|6]l would 
require knowledge of the arrival rates A m , and is a difficult 
computational task even if these rates are known. We provide 
two dynamic algorithms that use queue backlog information. 
These can also be viewed as online computation algorithms 
for computing probabilities p(a). Both are similar in spirit to 
the max-weight approach to dynamic scheduling in [8], but the 
variable frame lengths require a non-trivial extended analysis. 
Our first algorithm assumes knowledge of the arrival rates A m . 

Max- Weight Code Selection Algorithm 1 (Known A): At 
the beginning of each frame r, observe the queue backlogs 
Q m [r] and perform the following: 

• Choose code action a[r] G A as the maximizer of: 



M 

^2 Qm[r]\Pm(u[r]) 

m— 1 



X m T(a[r})} 



(7) 



where ties are broken arbitrarily. 

• Update the queue equation via Q. 

The next algorithm uses a ratio rule, and does not require 
knowledge of the rates A m : 

Max- Weight Code Selection Algorithm 2 (Unknown A): 
At the beginning of each frame r, observe the queue backlogs 
Q m [r] and perform the following: 

• Choose code action a[r] £ A as the maximizer of: 



M 

E 

m— 1 



Q n 



p m {a[r]) 
T(a[r}) 



(8) 



where ties are broken arbitrarily. 

• Update the queue equation via Q. 

Theorem 3: Suppose that A G A A . Then all queues are rate 
stable under either of the two algorithms above. 

Proof: See Appendix C. □ 

It can further be shown that if there is a value p such that 
< p < 1, and if A G pA A , being a p-scaled version of A_4, 



then both algorithms give average queue size 0(1/(1 — p)). 
Thus, the average backlog bound increases to infinity as the 
arrival rates are pushed closer to the boundary of the capacity 
region. This is proven in Appendix D. 

Define A as the action space that restricts to direct trans- 
missions, 2-cycle code actions, 3-cycle code actions, and the 
1-slot A + B + C code action that exploits double cycles, 



as described in Section II-B Algorithm 2 has a particularly 
simple implementation on action space A and when each 
packet has at most one destination. Indeed, we note that cycles 
can be defined purely on the user set Af, and any candidate 
cycle that involves a user-to-user part i — > j should use a 
packet of commodity m G {1, . . . , M} that maximizes Q m [r] 
over all commodities m that consist of packets intended for 
user j and contained as side information at user i. 



C. Example Simulation for 3 Users 



Average system backlog versus A. 



Uncoded transmission i 




i Algorithm 2 



0.2 0.3 0.4 0.5 

A. 



Fig. 2. Simulation of dynamic index coding for a 3 user system. 

Fig. [2] presents simulation results for a system with N = 3 
users, with action space A as defined above. We consider 
only algorithm 2, which does not require knowledge of rates 
A m , and compare against uncoded transmissions. All packets 
are intended for at most one user. Packets intended for user 
n G {1,2,3} arrive as independent Bernoulli processes with 
identical rates A. We assume each packet is independently in 
the cache of the other two users with probability 1/2. Thus, 
there are four types of packets intended for user 1: Packets 
not contained as side information anywhere, packets contained 
as side information at user 2 only, packets contained as side 
information at user 3 only, and packets contained as side 
information at both users 2 and 3. Users 2 and 3 similarly 
have 4 traffic types, for a total of M = 12 traffic types. 

Each data point in Fig. [2] represents a simulation over 
5 million frames at a given value of A. The figure plots 
the resulting total average number of packets in the system 
(summed over all 12 queues). The case of direct (uncoded) 
transmission is also shown. Uncoded transmission can support 
a maximum rate of A = 1/3 (for a total traffic rate of 1). It 
is seen that algorithm 2 can significantly outperform uncoded 
transmission, achieving stability at rates up to A = 0.57 (for 
a total traffic rate of 1.71). 
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Fig. 3. An illustration of a 2-user broadcast relay system, with the 3 possible 
transmission modes shown. In mode 3, packet P3 is received at both users. 



IV. Broadcast Relay Networks 

Consider now the following related problem: There are 
again N users and a single broadcast station. However, the 
broadcast station initially has no information, and acts as a 
relay to transfer independent unicast data between the users. 
Further, the users only know their own data, and initially have 
no knowledge of data sourced at other users. Time is again 
slotted, and every slot we can choose from one of N+l modes 
of transmission. The first N transmission modes involve an 
error-free packet transmission from a single user to the relay. 
The (7V+l)th transmission mode is where the relay broadcasts 
a single packet that is received error-free at each of the N 
users. Fig.|3]illustrates an example system with 2 users, where 
the 3 possible transmission modes are shown. For simplicity, 
we assume the user transmissions cannot be overheard by other 
users, and the users first send all packets to the relay. The relay 
then can make coding decisions for its downlink transmissions. 

A. The Minimum Clearance Time Relay Problem 

First consider a static problem where a batch of packets 
must be delivered in minimum time. Let Py represent the 
number of packets that user i wants to send to user j, where 
i,j € {1, . . . , iV}. All packets are independent, and the total 
number of packets is P, where: 



y-vjV y^JV 

L^di—\ 2—11 = 



P 



This problem is related to the index coding problem as follows: 
Suppose on the first P slots, all users send their packets to the 
relay on the uplink channels. It remains for the relay to send all 
users the desired data, and these users have side information. 
The resulting side information graph Q is the same as in the 
general index coding problem. However, it has the following 
special structure: The only user that has side information about 
a packet is the source user of the packet. Specifically: 

• Each packet is contained as side information in exactly 
one user. Thus, each packet node of Q has a single 
incoming link from some user that is its source. 

• Each packet has exactly one user as its destination. Thus, 
each packet node of Q has a single outgoing link to some 
user that is its destination. 

This special structure leads to a simplified graphical model 
for demands, which we call the weighted compressed graph 
WC(G) of Q. The graph WC(Q) is formed from Q as follows: 
It is a directed graph defined on the user nodes M only, 
and contains a link (a.b) if and only if the original graph Q 
specifies that user node a has a packet that user node b wants. 



Further, each link (a, b) is given a positive integer weight P &, 
the number of packets user a wants to send to user b. It is easy 
to show that WC(G) is acyclic if and only if Q is acyclic (see 
Appendix E). Hence, coding can only help if WC(G) contains 
cycles, and so T m i n {Q) = P whenever WC(G) is acyclic. 

We say the weighted compressed graph WC(Q) has disjoint 
cycles if each link participates in at most one simple cycle. An 
example is shown in Fig. [4] Consider such a graph that has 
C disjoint cycles. Let w™ m be the min-weight link on each 
disjoint cycle c 6 {1, . . . , C}. 

Theorem 4: If the broadcast relay problem has a weighted 
compressed graph WC(G) with disjoint cycles, then: 



T m m{Q)=P-Y%= 1 w T c 



(9) 



and so the full clearance time (including the P uplink transmis- 
sions) is the above number plus P. Further, optimality can be 
achieved over the class of cyclic coding actions, as described 
in Section IH-BI 

As an example, the graph WC(Q) in Fig. [4^ has P = 48, 
three disjoint cycles with w™ 112 = 4, id™ 71 = 4, w™ m = 1, 



4, w^ m 

and so T min (Q) = 48 - 4 - 4 - 1 = 39. 





Fig. 4. (a) A graph WC(Q) with three disjoint cycles, and (b) its pruned 
graph WC(g r ). 

Proof: First prune the graph WC(Q) by removing the 
min-weight link on each of the disjoint cycles (breaking ties 
arbitrarily). This corresponds to removing those packets from 
the original graph Q, to produce a new graph Q' with exactly 
P — X2 c =i W T" 1 packets. The weighted compressed graph 
WC(Q r ) is the subgraph of WC(Q) with the min-weight links 
on each disjoint cycle removed (see Fig.|4^ and Fig.|4|)). Both 
0' and WC(G') are acyclic, and so: 



It remains only to construct a coding algorithm that achieves 
this lower bound. This can be done easily by using u;™ m 
separate cyclic coding actions for each of the disjoint cycles 
(using a fc-cycle coding action for any cycle of length k), and 
then directly transmitting the remaining packets. □ 

B. Traffic Structure and Optimality of Cyclic Coding 

Suppose we have a broadcast relay problem with N users, 
packet matrix (Pij), and with the following additional struc- 
ture: Each user i <E {1, . . . , N} wants to send data to only 
one other user. That is, the matrix (Pij) has at most one non- 
zero entry in each row i € {1, . . . ,N}. We now show that 
the resulting graph WC(Q) has disjoint cycles. To see this, 
suppose it is not true, so that there are two overlapping cycles. 
Then there must be a shared link (a, b) that continues to a link 
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(b, k) for cycle 1 and (b, in) for cycle 2, where k ^ m. This 
means node b has two outgoing links, a contradiction because 
matrix (Py) has at most one non-zero entry in row b, and 
hence at most one outgoing link from node b. We conclude 
that WC(Q) has disjoint cycles, and so cyclic coding is optimal 
via Theorem H] 

A similar argument holds if each user wants to receive from 
at most one other user, so that (Py ) has at most one non-zero 
entry in every column. Again, WC{Q) has disjoint cycles, and 
so cyclic coding is optimal. 

C. Dynamic Broadcast Relay Scheduling 

Now consider the dynamic case where packets from source 
user i and destination user j arrive with rate Ay packets/slot. 
Suppose we have an abstract set of coding actions A, where 
each action involves a subset of packets, and first transmits 
these packets to the relay before any coding at the relay. 
Let T(a) be the number of slots to complete the action, and 
(fiij(aj) be the matrix of packets delivered by the action. 
It can be shown that capacity can be approached arbitrarily 
closely by repetitions of minimum-clearance time scheduling 
on large blocks of the incoming data (similar to the capacity 
treatment in [9] for a limit of large packet size). Hence, if 
T m in(G) can be optimally solved using only cyclic-coding 
actions, then capacity is also achieved in the max-weight 
algorithms when A is restricted to cyclic-coding actions. It 
follows that such actions are optimal for rate matrices (Ay) 
with at most one non-zero entry per row, and for rate matrices 
(Xij) with at most one non-zero entry per column. 

D. Counterexamples 




Fig. 5. Two example graphs WC(6) for broadcast relay problems. 

Can we minimize clearance time by grabbing any available 
2-cycle, then any available 3-cycle if no 2-cycle is available, 
and so on? Not necessarily. A simple counterexample is shown 
in Fig. [5^. The graph has 5 users and 7 packets {A, . . . , G}, 
where each link has a single packet. Using the middle 3-cycle 
2^3^4^2by transmitting B + D and D + E leaves 
a remaining acyclic graph with 4 packets, and hence would 
take 4 more transmissions, for a total of 6 slots. However, 
using the two side cycles (with 2 transmissions each) and then 
transmitting the remaining packet E clears everything in 5 
slots, which is optimal because the maximum acyclic subgraph 
has 5 packets (just remove links B and D). 

One may wonder if all broadcast relay graphs can be opti- 
mally cleared with cyclic coding. We can show this is true for 
N — 2 and N = 3 (see Appendix E and F). However, this is 
not true in general for N > 3. Fig. shows a counterexample 



with N = 6. Suppose each link has a single packet, so that we 
have 9 packets {A, . . . , I}. It can be shown that the maximum 
acyclic subgraph has 7 packets, and so T min (Q) > 7, but 
the best cyclic coding method uses 8 slots. Here is a way to 
achieve 7 slots: Send messages Mi = E+G+F, M 2 = H+E, 
M 3 = H + D, Ah = A + B + H, M 5 = C + B, M 6 = C + G, 
Mi = C + I + D. The decodings at users 2,3,4,5,6 are 
straightforward by combining their side information with just 
a single message. The decoding at user 1 is done as follows: 
Mi + M 2 + M 3 + M 6 + M 7 = F + I. Since user 1 knows F, 
it can decode /. M3 + M4 + M5 + M7 = A + I, since user 1 
knows I it can get A. 

V. Conclusions 

This work presents a dynamic approach to index coding. 
This problem is important for future wireless communication, 
where there may be many instances of side information that 
can be exploited. While optimal index coding for general prob- 
lems seems to be intractable, we develop a code-constrained 
capacity region, which restricts actions to a pre-specified set 
of codes. Two max-weight algorithms were developed that 
can support randomly arriving traffic whenever the arrival 
rate vector is inside the code-constrained capacity region. The 
first algorithm requires knowledge of the rate vector, and the 
second does not. Simulations verify network stability up to the 
boundary of the code-constrained capacity region, and illus- 
trate improvements in both throughput and delay over uncoded 
transmission. For coding to provide gains in comparison to 
direct transmission, it must exploit cycles in the demand graph. 
A simple set of codes based on cycles was considered and 
shown to be optimal (so that the code constrained capacity 
region is equal to the unconstrained capacity region) for certain 
classes of broadcast relay networks. These results add to the 
theory of information networks, and can be used to improve 
efficiency in wireless communication systems. 

Appendix A — Proof of TheoremQ] 

Proof: (Theorem [TJ We already know that T min (Q) < P. 
It suffices to show that T m i n (Q) > P. Consider any mission- 
completing coding action that takes T slots. We show that 
T > P. Let M. be the sequence of messages transmitted. 
Then every node n E TV is able to decode its desired packets, 
being packets in the set lZ n , from the information {H n , M}, 
being the information it has at the end of the coding action. 
That is, we have: 

{H n ,M} {H n ,M,K n }Vn£{l,...,N} (10) 

Because the graph is acyclic, there must be at least one 
node with no outgoing links (by Fact[T]i. Choose such a node, 
and label this node n\. The node n\ cannot be a packet node, 
because we have assumed that all packet nodes have outgoing 
links. Thus, n\ G TV. Because node n\ has no outgoing links, 
it has H ni = <fi and thus has no initial side information about 
any of the packets. Thus, it is able to decode all packets in 
the set 1Z ni by the messages M. alone. That is: 

M {M,K ni } (11) 



8 



We want to show that this node ri\ can decode all packets in 
the set P, so that: 



M 



{M,P} 



(12) 



If we can show that ( p"2| ) holds, then the sequence of messages 
M. is also sufficient to deliver P independent packets to node 
rii, and node n\ did not have any initial side information about 
these packets. Thus, the number of slots T used in the coding 
action must be at least P by Fact [2] proving the result. Thus, 
it suffices to prove \Y1) . 

We prove ( |T2] > by induction on k, for k £ {1, .. ., N — 
1}: Assume that there is a labeling of k distinct user nodes 
{ni,n 2 , ■ ■ ■ , nk} such that: 



{M} 



{M,n ni ,...,n nh } 



(13) 



This property holds for the base case k — 1 by (jTTJl. We now 
assume that ( fT3j ) holds for a general fee {1, . . . , N — 1}, and 
prove it must also hold for k + 1. Take the graph Q, and delete 
the user nodes {n\, . . . , rik}, also deleting all links outgoing 
from and incoming to these nodes. This may create packet 
nodes with no outgoing links: Delete all such packet nodes. 
Note that all deleted packet nodes (if any) must be in the set 
{lZ ni , • ■ ■ , P-n k }, being the set of packets desired by the users 
that are deleted. The resulting subgraph must still be acyclic, 
and hence it must have a node rik+i with no outgoing links. 
This node must be a user node, as we have deleted all packet 
nodes with no outgoing links. 

Because the user node n^+i has no outgoing links, it 
either had H nk+1 = 4> ( so that it never had any outgoing 
links), or all of its outgoing links were pointing to packet 
nodes that we have deleted, and so those packets were in 
the set {TZ ni , . . . ,H nk }. That is, we must have H„ fc+1 C 
{Ti ni Tl nk }. Therefore: 



{M,H nk+1 }C{M,K ni ,...,n nk } 



(14) 



However, at the end of the coding action, node n^+i has 
exactly the information on the left-hand-side of (14) , and hence 
this information is sufficient to decode all packets in the set 
P~n k+1 - Thus, the information on the right-hand-side of ( fT4| ) 
must also be sufficient to decode 1Z nh+1 , so that: 

{M,K ni , . . . ,K nh } ^ {M,K ni ,...,K nh ,K nk+1 } 

But this together with ( fT3| ) yields: 



{M} 



{M,n ni ,...,n nk ,n nh+1 } 



which completes the induction step. 

By induction over k € {1, . . . , N — 1}, it follows that: 

{M} ^ {M,K ni ,...,K nN } (15) 

However, by re-labeling we have: 

{K ni ,...,K nN } = {K 1 ,...,K N } = V (16) 

where the final equality holds by ([TJ. Combining ([15) and ( fTo*) 
proves (|12). □ 



Appendix B - Proof of Necessity for Theorem[2] 

Let {a[r]}^ be a sequence of actions, chosen over frames, 
that makes all queues Q m [r] rate stable. We show there must 
exist probabilities p(a) that satisfy |6]). For each positive 
integer R and each m € {1, ... , M}, define a m [i?] and JI m [R] 
as the following averages over the first R frames: 



a m [R] 4 



~E E r =o arrival 



where arrivals m [r] is defined in |5]). Now define P(a,R) 
as the set of frames r € {0, . . . , R — 1} that use action a, 
and define |J-"(a,i?)| as the number of these frames, so that 
T, a eA l^tei R)\ = R- We then have: 



a m [R] - Y, a eA 
1 



\?(<*,R)\ 

R 



\T(a,R)\ ^ , 



arrivals m [r] (17) 



Hm[R\ = 22 a eA ^ V™(a) 



(18) 



The set A is finite. Thus, the values {(\T(a, H)\/ R)}^ =1 can 
be viewed as an infinite sequence of bounded vectors (with 
dimension equal to the size of set .4) defined on the index R e 
{1, 2,3,.. .}, and hence must have a convergent subsequence. 
Let Rk represent the sequence of frames on this subsequence, 
so that there are values p(a) for all a £ A such that: 

lim \T(a,R k )\/R k = p(a) 
Further, by ( fT8| we have for all me {1, . . . , M}: 

lim jL m [Rk] = J2 a eAP( a )^n(a) 



k— >oo 



(19) 



Likewise, from ( fTT) and the law of large numbers (used over 
each a 6 A for which lim / t_ i . 00 \ JF(a, Rk)\ = oo, and noting 
that arrivals m [r] is i.i.d. with mean T(a)X m for all r € 
P(a,R)) we have with probability 1: 



lim a m [Rk] = V" p(a)T(a)X r , 



(20) 



a£A 



Because \P(a, Rk)\/Rk > for all a E A and all R^, and 
J2aeA \-F{ a i Rk)\/Rk = 1 for all Rf., the same holds for the 
limiting values p(a). That is, p(a) > for all a € A, and: 
J2aeAP( a ) = 1- B ecause eacn queue Q m [r] is rate stable, 
we have with probability 1 that for all m E {1, ... , M}: 



lim 

k— f Co 



Qm[Rk] 

Rk 







(21) 



However, from the queue update equation Q we have for all 
re {0,1,2,...}: 

Q m [r + 1] > Q m [r] - fi m (a[r]) + arrivals m [r] 

Summing the above over r 6 {0, 1, . . . , Rk — 1} and dividing 
by Rk yields: 



Qm[Rk]-Qm[0] 

Rk 



> -fj, m [R k ] + a m [R k ] 
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Taking a limit as k — » oo and using (fT9]l-(|2T| yields: 

> - E« e ^J 3 ( a )Mm(«) + A m p(a)T(a) (22) 



This proves the result. 



Appendix C — Proof of Theorem[3] 

We first prove rate stability for Algorithm 2, which uses a 
ratio rule. The proof for Algorithm 1 is simpler and is given 
after. We have the following preliminary lemma. 

Lemma 2: (Sufficient Condition for Rate Stability [10]): 
Let Q[r] be a non-negative stochastic process defined over 
the integers r £ {0, 1,2,.. .}. Suppose there are constants B, 
C, D such that for all frames r £ {0, 1,2,.. .} we have: 



E{(Q[r 



i]-QM) 2 } 

E{Q[r] 2 } 



< 
< 



D 
Br 



C 



(23) 
(24) 



Then lim r _ i . 00 Q[r]/r = with probability 1. 

The condition ( |23"j ) is immediately satisfied in our system 
because the queue changes over any frame are bounded. Thus, 
to prove rate stability, it suffices to show that ( f24] > holds for all 
queues and all frames. That is, it suffices to prove the second 
moment of queue backlog grows at most linearly. 

For each frame r £ {0,1,2,...}, define the following 
quadratic function L[r], called a Lyapunov function: 

Define the conditional Lyapunov drift A[r] to be the expected 
change in L[r] from one frame to the next: 

A[r]iM{L[r + l]-L[r]\Q[r]} 

where Q[r] = (Qi[r], . . . , QmH) is the queue backlog vector 
on frame r. The above conditional expectation is with respect 
to the random arrivals over the frame and the (possibly 
random) coding action chosen for the frame. 

Lemma 3: Under any (possibly randomized) decision for 
a[r] £ A that is causal (i.e., that does not know the future 
values of arrivals over the frame), we have for each frame r: 

A[r] < B + Ero=iQmME {X m T(a[r}) - ^ m (a[r])\Q[r]} 

where B is a finite constant that satisfies: 

B > iE™=l E { a ™«^mW 2 +Mm(«H) 2 |QW} 

Such a finite constant B exists because frame sizes are 
bounded, as are the arrivals per slot. 

Proof: For simplicity of notation, define b m [r]=fx m (a[r]), 
and a m [r]=arrivals m [r]. The queue update equation is thus: 

Q m [r + 1] = max[Q m [r] - b m [r},0] + a m [r] 

Note that for any non-negative values Q, a, b we have: 

(max[Q - 6, 0] + a) 2 <Q 2 + b 2 + a 2 + 2Q{a - b) 

Using this and squaring the queue update equation yields: 

Q m [r+l} 2 < Q m [r} 2 +b m [r} 2 +a m [r] 2 +2Q m [r}[a m [r}-b m [r}} 



Summing over all m, dividing by 2, and taking conditional 
expectations yields: 

A[r] < B + EiLiQm[r]E{a m [r] - b m [r]\Q[r}} (25) 

Now note thatQ 

!t[r]+T(a[r])-l 
A m (r)\Q[r] 
r=t[r] 

= E{X m T[r}\Q[r}} (26) 



Plugging this identity into ( |25[ l proves the result. □ 
We now prove that Algorithm 2 yields rate stability. 
Proof: (Theorem [3] — Stability Under Algorithm 2) Suppose 

that Algorithm 2 is used, so that we choose a[r] every frame 

r via ((8). We first claim that for each frame r and for all 

possible Q[r] we have: 



v{E™=iQmlr]»m(a[r])\Q[r]} 

E{T(a[r])\Q[r}} : 
E {Em=i Qrn[r]v m {a*[r))\Q[r]j 



E{T(a*[r])\Q[r]} 



(27) 



where a* [r] is any other (possibly randomized) code action 
that could be chosen over the options in the set A. This can 
be shown as follows: Suppose we want to choose a[r] £ A 
via a possibly randomized decision, to maximize the ratio of 
expectations in the left-hand-side of ( f27] >. Such a decision 
would satisfy ( f27] > by definition, since it would maximize 
the ratio of expectations over all alternative policies a*[r]. 
However, it is known that such a maximum is achieved via a 
pure policy that chooses a particular a £ A with probability 
1 (see Chapter 7 of [7]). The best pure policy is thus the one 
that observes the queue backlogs Q\r\ and chooses a[r] £ A 
to maximize the deterministic ratio, which is exactly how 
Algorithm 2 chooses its action (see ([H}). 
Thus, (|27) holds. We can rewrite ( |2"7| ) as: 



E {Em=i Qm[r]fJ> m (a[r])\Q[r]j 
E{T(a[r])\Q[r}} 



> 



E{^ m (a*[r])\Q[r}} 
1 E{T(a*[r])\Q[r}} 



(28) 



We can thus plug any alternative (possibly randomized) de- 
cision a*[r] into the right-hand-side of ( |28] i. Consider the 
randomized algorithm that independently selects a £ A 
every frame, independent of queue backlogs, according to 
the distribution p(a) in Theorem [2] Let a* [r] represent the 
randomized decision under this policy. Then from |6]) we have 
for all m€ {I,.. . , M}: 

E{p, m (a*[r])} E{p, m (a*[r])\Q[r]} 
Am ~ E{T(a*[r))} E {T(a* [r])\Q[r]} ( " } 



Equality [26) uses causality and the i.i.d. nature of the arrival process. It is 
formally proven by conditioning on T(a[r]) and using iterated expectations. 
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where the last equality holds because a* [r] is chosen indepen- 
dently of Q[r\. Using this in ( |28j > yields: 



E{E™=iQmMMm(a[r])|Q[r]} M 

E{T(a[r])\Q[r}} " l^® m ^ Xm 

Rearranging terms above yields: 

E%=iQm[r]E{\ m T(a[r]) - fi m (a[r])\Q[r]} < (30) 

Plugging ([30} into the drift bound of Lemma [3] yields: 

A[r] < B 

Taking expectations of the above and using the definition of 
A[r) yields: 

E{L[r + 1]} -E{L[r]} <5Vr€ {0,1,2,...} 
Summing the above over r G {0, 1, . . . , R — 1} yields: 

E{L[R}}-E{L[0}} < BR 
and hence for all R > 0: 

Em=i E {QmW < 2E {L[0]} + 2BR 

Thus, the second moments of all queues grow at most linearly, 
from which we guarantee rate stability by Lemma [2] □ 

We now prove that Algorithm 1 yields rate stability. 

Proof: (Theorem [3] — Stability Under Algorithm 1) Note 
that Algorithm 1 is designed to observe queue backlogs Q[r] 
every frame r, and take a control action a[r] € A to minimize 
the right-hand-side of the drift bound in Lemma [3] Therefore, 
we have: 

M 

A[r] < B + J2 Q m [r]E{X m T(a*[r]) - » m (a*[r])\Q[r}} 

m— 1 

where a*[r] is any other (possibly randomized) decision. If 
a*[r] makes a decision independent of Q[r] we have: 

M 



Appendix D — Proof of the Queue Size Bound 

Here we show that if A € pA_4, where < p < 1, then both 
Algorithm 1 and Algorithm 2 yield finite average backlog of 
size 0(1/(1 - p)). 

Proof: (Queue Bound for Algorithm 1) Because A <E pA_4, 
we have: 

{\m/p) G A.4 

Thus, from Theorem [2] there is a randomized algorithm a* [r] 
that makes decisions independent of queue backlogs to yield 
the following for all m € {1, . . . , M}: 



Am < E{p m (a*[r])} 
p ~ E{T(a*[r})} 



(32) 



Define T*=E {T(a* [r])}. Using this and rearranging the 
above gives: 

E{p m (a*[r})} > X m T*/p Vm E {1, . . . , M} 

Substituting the above into ( |3~Tj ) gives: 

M 

A[r] < B + Q m [r]T*X m (l - 1/p) 

m— 1 

Taking expectations and using the definition of A[r] gives: 
E{L[r + l]}-E{L[r}} < B+J2^{Qm[r}}T*X m (l-l/p) 

m— 1 

Summing over r e {0, . . . , R — 1} (for any integer R > 0) 
gives: 

E{L[i?]}-E{i[0]} < BR+Y, E E {g m [r]}T* A m (l-l/p) 

r— m— 1 

Using the fact that E{L[r]} > and E{i[0]} = 0, dividing 
by R, and rearranging terms gives: 

r=0 m=l V P ' 

Because T* > 1, the above bound can be simplified to 
Bp/(1 — p). The above holds for all R, and so the expected 
queue backlog is 0(1/(1— p)). Further, from [7] we can derive 
that the following holds with probability 1: 

R-l M 



A[r] <B+Y Q m [r}E{X m T(a*[r})-» m (a*[r})} (31) lim I y y XMr] < Bp_ 



Consider again randomized algorithm a* [r] that independently 
and randomly selects an action in A every frame, independent 
of queue backlogs, according to the distribution p(a) in Theo- 
rem H Then ((29) again holds, so that for all m £ {1, . . . , M}: 

E{X m T{a*[r]) - fi m (a*[r})} <0 

Substituting the above into the right-hand-side of ( (31) gives: 

A[r] < B 

from which we then obtain rate stability in the same way as 
in the proof for Algorithm 2. □ 



□ 



P) 



Proof: (Queue Bound for Algorithm 2) Recall that ((28) 
holds for every frame r and all possible Q[r] for Algorithm 2. 
Using a* [r] as an algorithm that makes randomized decisions 
on frame r that are independent of Q[r] gives: 

®{E%=iQrn[r}Vm(a[r])\Qir]} 

E{T(a[r])\Q[r}} " 

Eli Qm H E {v™ (a* H ) I m . 

E{T(a*[r])} 1 ; 

where we have removed the conditional expectations on the 
right-hand-side. Because (X m /p) E A_4, we know that there 
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is an algorithm that makes independent and randomized deci- 
sions to yield ([32|. Plugging ( [32] ) into the right-hand-side of 
d33l gives: 



E {Em=i Q m [r]fJ>m{a[r])\Q[r}} 
E{T(a[r])\Q[r}} 

Rearranging gives: 

M 



M 

> ^ Q?n[r]\ m /p 

m—1 



J2Q m [r}E{p m (a[r])\Q[r}}> 

m—1 

M 

- Qm[r]E{\ m T(a[r])\Q [r] } 



Using this in the drift bound of Lemma [3] gives: 



A I 



A, 



A[r] < B + ]T Q m [r]E I X m T(a[r\) - -^T{a[r])\Q[r] 

m—1 ^ ^ 

That is: 

M 

A[r] < B + Qm[r}X m {l-l/p)E{T(a[r})\Q[r]} 



m—1 
M 



< B+ <9mHA m (l-l/p) 



where we have used the fact that E {T^r])!^?-]} > 1, and 
1 — 1/p < 0. We thus have by the same argument as in the 
previous proof that for any R > 0: 



R-l M 



^J2J2 A m E {Q m [r]} < - 



P B 



R 

r— m—1 

and with probability 1: 

R-l M 



lim 



r— rn— 1 



□ 



Appendix E — The Disjoint Cycle Theorem for 
General Index Coding 

The weighted compressed graph WC(Q) was introduced in 
Section [IV] for the context of broadcast relay networks, being 
special cases of index coding problems where each packet 
has exactly one incoming link and exactly one outgoing link. 
However, such a graph is also useful for general index coding 
problems, with general directed bipartite graphs Q. 

Consider any such general directed bipartite graph Q with 
user nodes M and packet nodes V. Define WC{Q) similarly: It 
is a graph on the user nodes J\f, a link (i, j) exists if and only 
if user i has a packet as side information that is wanted by 
user j, and each link is weighted by Py, the (integer) number 
of distinct packets of this type. An example of a graph Q and 
its weighted compressed graph WC(Q) is given in Fig. [6] 

Note that weighted compressed graphs in this case typically 
include less information than the original graph Q. For exam- 
ple, the existence of a link (a, b) with weight P a b in the graph 
WC(G) tells us that node a has P a b distinct packets that are 




User set 'N 



Packet set 'P 



Weighted Compressed 
Graph ~WC(g) 



Fig. 6. An example graph Q and its weighted compressed graph WC{Q). 



wanted by node 6, but does not specify which packets these 
are, or if these packets also take part in the weight count on 
other links of WC(Q). Indeed, in the example of Fig. [6] packet 
C is the packet corresponding to the weight 1 on link (1,2) in 
WC(G), and packets C and D correspond to the weight 2 on 
link (1,3) in WC(G), so that the same packet C is included 
in the weight for two different links. Note also that, unlike 
broadcast relay networks, the sum of the weights of WC(G) 
is not necessarily equal to the number of packets P in Q. 

Lemma 4: The original demand graph Q is acyclic if and 
only if WC(Q) is acyclic. 

Proof: We show that Q has cycles if and only if WC(G) 
has cycles. The proof can be understood directly from Fig. [7] 
Suppose that WC(G) has a cycle of length K, as shown in Fig. 
[7^. Relabeling the nodes, this cycle uses nodes {1, . . . . K}, 
and has links {(1, 2), (2, 3), ... , (K-1,K), {K, 1)}. However, 
for each link (a, b) of this cycle, there must be a packet x a t E 
V such that x a b G 7~La ( so the graph Q has a directed link from 
user node a to packet node x a b), and x a t, £ IZb (so the graph 
Q has a directed link from packet node x a b to user node b). 
This means that the original graph Q has a cycle, as depicted 
in Fig. [Ji. 

Conversely, if the bipartite graph Q has a cycle, it must 
alternate between user nodes and packet nodes, with a structure 
as depicted in Fig. From this, it is clear that WC{Q) also 
has a cycle. □ 




{a) A cycle of size K on "WC(0. 




(b) A cycle of size 2K on £j. 



Fig. 7. An illustration for the proof of Lemma [4] 

Lemma |4] makes it easier to see whether or not a given 
graph Q is acyclic. That the graph Q in Fig. [6] has cycles 
is immediately apparent from its much simpler weighted 
compressed graph WC(Q), which has a single cycle of size 
2 consisting of nodes 1 and 3. As another example, it may 
not be immediately clear that the graph Q in Fig. [T]is acyclic. 
However, its graph WC(Q) has only 3 nodes J\f = {1,2,3} 
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and two links (1,3) and (3,2), and from this the acyclic 
structure is obvious. These compressed graphs are also useful 
for coding, even when they have cycles, as shown next. 

We say that a packet is a unicast packet if it has at most one 
outgoing link (so that it is intended for only one destination 
user), and a packet is a multicast packet if it is intended for 
more than one destination user. We say that the weighted 
graph WC(Q) has disjoint cycles if no link participates in 
more than one cycle. Suppose now that WC(G) has disjoint 
cycles, and let K be the number of such cycles. As before, for 
each k € {1, . . . ,K}, we let represent the set of links 

(k) 

for the kth cycle, and let w min represent the weight of the 
minimum weight link in C^ k K Note that removing this link 
from each cycle results in a new graph that is acyclic. This is 
used in the following theorem. 

Theorem 5: Let Q be a demand graph with N nodes and 
P packets. Suppose that WC(Q) has disjoint cycles, that there 
are K such cycles, and that all packets of these cycles are 
distinct and are unicast packets. Then: 

K 

where P is the number of packets in Q. Furthermore, the 
minimum clearance time can be achieved by performing cyclic 
coding w^in times for each cycle k g {1, . . . , K}, and then 
transmitting all the remaining packets without coding. 

Proof: For each cycle k E {l,...,K}, select a link 

(k) 

with a link weight equal to the minimum link weight w min 
(breaking ties arbitrarily). Let T>( k > be the set of all packets 
associated with this link. All packets in U^ =1 V^- k ' are distinct 
(by assumption), and the total number of these packets is: 

|uL^ (fe) l = f>it 

fc=i 

Now consider the subgraph Q' formed from Q by removing 
all packet nodes in the set U^ =1 V^ k '. The number of packets 
P' in this graph is thus: 

k=l 

Further, we have T min {Q') < T min {Q). However, note that 
WC(Q') is the same as WC{Q), with the exception that the 
min-weight link on each of the K cycles has been removed. 
Thus, WC(Q') is acyclic, so that Q' is acyclic, and by Theorem 
[T]we have T min {G') = P'. It follows that: 

T mm {g) > p' 

Thus, any algorithm for clearing all packets in the graph Q 
must use at least P' slots. We now show that T min (G) = P' by 
designing a simple cyclic coding scheme to clear all packets 
in the original graph Q in exactly P' slots. By assumption, all 
packets that participate in cycles are distinct unicast packets, 
and hence they only need to be delivered to one destination, as 
defined by the link of the cycle they are in. This includes all 
packets in the set U^Lj^T 3 '' '. We now use the obvious strategy: 
For each cycle k G {1, . . . , K}, we choose an undelivered 



packet p 6 V^ k \ and use a cyclic coding operation that clears 
distinct packets (including packet p) in S 1 -^ — 1 slots, 
where S 1 -^ is the size of cycle k. This can be done because 
packet p is from the link of cycle k with the fewest packets, 
and so there are always remaining packets on the other links 
of the cycle to use in the coding. Once this is done for all 
cycles and for all packets on the min-weight link of each cycle, 
we then transmit the remaining packets uncoded. The total 
number of transmissions is thus equal to P minus the savings 
°f SfcLi w mln from all of the cyclic coding operations. Thus, 
this takes exactly P — J2k=i w min = P' slots. □ 
Corollary 1: If the demand graph Q has P packets, but only 
two users (so that M = {1, 2}), then: 

Pmini.G) P ^min 

where w m i n = min[P 12 , P21], being the weight of the min- 
weight link in the graph WC(Q). Further, this minimum 
clearance time can be achieved by performing cyclic coding 
over the packets associated with this min weight link, and then 
transmitting (uncoded) all remaining packets. 

Proof: It suffices to show that WC(Q) has disjoint cycles, 
and that all packets that are part of these cycles are distinct 
unicast packets. The graph WC(Q) has only 2 nodes and hence 
at most one cycle, so it clearly satisfies the disjoint cycle 
criterion. All packets of the cycle are clearly distinct. Indeed, 
two packets on different links of the cycle must have one 
packet on link (1,2) and the other on link (2, 1), and hence 
the first is desired by user 2, while the second is not desired 
by user 2 (so they cannot be the same packet). It remains to 
show that all packets in the cycle are unicast packets. If there 
are no cycles, we are done. If there is a cycle, this involves 
link (1, 2) and link (2, 1). Any packet associated with the link 
(1,2) must already be in the set of packets that node 1 has, 
and so this packet only requires transmission to node 2 (not to 
node 1). Thus, this packet must be a unicast packet. Similarly, 
any packet associated with link (2, 1) must be a unicast packet. 

□ 

Appendix F — Optimality of Cyclic Coding for 
Relay Networks with N = 3 

Consider now the special case of broadcast relay networks, 
where each packet node of the graph Q has exactly one 
incoming link and one outgoing link. There are N = 3 users 
and P packets (where P is a positive integer). This can be 
exactly represented by the weighted compressed graph WC(G) 
with link weights Py, where P = J^iLi S^Li Pij- We want 
to show that minimum clearance time can be achieved by 
cyclic coding (using either direct transmission, 2-cycle code 
actions, or 3-cycle code actions). If the 3-node graph WC(G) 
has disjoint cycles, we are done (recall Theorem [4}. 

Consider now the general case with possibly non-disjoint 
cycles, as shown in Fig. [8] To represent the general case, we 
allow link weights P a j, to possibly be (a weight of is 
equivalent to the absence of a link). First define mm^, minu,, 
min-ix as the weight of the min-weight link for each of the 
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Initial graph WC(Gj) 


Case 1 (example) 






[ A 




hi 




Case 2 (parti) 


Case 2 (part 2) 



Fig. 8. An illustration of the general broadcast relay graph WC(Q) with 
TV = 3 users, and the two cases required for the proof. 



three possible 2-cycles: 

min\2 = min[Pi2,P2i] 
min 23 = mm[P 23 ,P 32 ] 
min 31 = mm[P 3 i,Pi 3 ] 

Now prune the graph WC(Q) by removing the min-weight 
link for each 2-cycle. This results in a graph WC(G r ) with 3 
nodes and (at most) 3 links, and with a total number of packets 
equal to P — minyx — min,23 — min^i, as shown in Cases 1 
and 2 in the figure. We have two cases. 

Case 1: The resulting graph WC(Q') is acyclic. An example 
of this case is shown as case 1 in Fig. [8] Thus, we know 
Tmin(G) > T mm (Q') = P - min 12 - mm 23 - min zl . 
However, it is easy to see this clearance time bound can be 
achieved by using 2-cycle code actions on each of the three 

2- cycles, and then transmitting the remaining packets uncoded. 
Case 2: The resulting graph WC(G') consists of a single 

3- cycle. The cycle must either be clockwise or counter- 
clockwise. Without loss of generality, assume clockwise (see 
Fig. [8j case 2 part 1). Note that: 



P\2 > P21 , P23 > P32 , P31 > Pic 



(34) 



This is because we have formed WC(Q') by removing the 
min-weight link on each of the three 2-cycles. 

Let z = min[Pi2 - P21, P23 - P32, P31 - Pia], so that z is 
a non-negative integer. Without loss of generality, assume the 
min value for z is achieved by link (1, 2), so that z = P12 — 
P 2i . To WC(Q'), add back the link (2, 1) (with weight P 2 i), 
and remove the link (1,2), to yield a graph WC(Q") that is 
an acyclic subgraph of the original graph WC(Q), as shown in 
case 2 part 2 of Fig. [s] The acyclic subgraph WC(Q") contains 
exactly P21 +P23 + P31 packets. Thus, the minimum clearance 
time of the original graph WC(Q) is at least P21 + P23 + 
P31. However, this can easily be achieved. Do the following: 
Perform 2-cycle code actions on each of the three 2-cycles of 
the original graph WC(Q), to remove a number of packets on 
each cycle equal to the min weight link of that cycle. This 
removes 2P21 + 2P32 + IP13 packets in P21 + P32 + P13 slots 



(recall the three min weights are given by |34|). Then perform 
the 3-cycle code action to remove 3z packets in 2z slots. Then 
perform direct transmission to remove the remaining packets, 
being a total of x = P - 2P 2i - 2P 32 - 2P i3 - 3z. The total 
number of transmissions is: 

P21 + P32 + P13 + 2z + x 
= P21 + P32 + Pi3 + 2z 

+P - 2P 21 - 2P 32 - 2P 13 

= P — P21 — P32 — P13 — z 
= P12 + P23 + P31-Z 

= P\2 + p23 + P3I — (p2 — p2l) 
= p21 + -P23 + -P3I 

and so the above scheme is optimal. 



3.3 
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